Day 1: Core AI concepts
Distributional Impact of Policies. Fiscal Policy and Growth Department
2026-04-20
π» IDE: Integrated Development Environment, e.g., Positron, RStudio, VS Code, and more.
π€ AI assistants:
π Context: information that AI uses to generate responses.
π’ Tokens: pieces of text that LLMs process.
β Requests: prompts, instructions.
π§ LLMs: Large Language Models, e.g., GPT, Claude, and more.
β Hallucinations: when AI generates incorrect or fabricated information.
π§βπ€βπ§ Agents, skills, tools, hooks, plugins, and more (for Day 2).
An IDE is the workspace for coding and executing data analysis:
There are many IDEs, both with user interfaces (UIs) and as command-line interfaces (CLIs).
Most are made for programming. Positron, RStudio, and JupyterLab β for data science.






Those are: Positron | VS Code | Cursor | Claude Code | and moreβ¦
π» Code, data, and analysis stay inside the IDE on the userβs computer.
Positron Assistant works in the IDE as a middleman that connects the user, IDE, and cloud-based AI services.
GitHub Copilot is a cloud-based AI orchestrator: coordinates the assistant and LLMs, submits prompts to the LLMs, and pre-processes responses before sending them back to the assistant.
π’ The LLM:
β The user controls and decides whether to accept, edit, or reject the suggested code.
Be mindful of context size and content.
β οΈ LLMs have a maximum context window (4kβ2M tokens) β exceeding it leads to information loss.
β οΈ When the context window fills up, old context is pushed out, causing information loss and reduced quality.
ποΈ Assistants may compress context via LLM summarization β but this is suboptimal: key details can be lost.
Best practices:
πΎ Data/secrets in console β context exposed to model
π LLM may include data files (csv, json, txt) β data exposed to model
π Source code β code is context, can leak
π₯ Insecure/destructive actions β always review before accepting
Be cautious with sensitive data. Review all suggestions. Ask the LLM to explain its suggestions. Follow recommendations to safeguard data
Use AI responsibly and securely: https://ai.worldbank.org/risk-mitigation


From Neural Networks Series by 3Blue1Brown and NN-Full Video Course


From Neural Networks Series by 3Blue1Brown and NN-Full Video Course
Next-token prediction example
From Neural Networks Series by 3Blue1Brown and NN-Full Video Course
Context-dependent LLM prediction
From Neural Networks Series by 3Blue1Brown and NN-Full Video Course
LLM inference diagram
Adapted from 10.48550/arXiv.2408.02549
Hallucinations occur when an LLM generates text that is fluent, confident, and wrong. Common causes: insufficient or irrelevant context, training data biases or gaps, and over-generalization of learned patterns.
Hallucination patterns
Mitigation:
We will focus Day 2 on advanced AI features that enrich context and mitigate hallucinations:
π§ Tools β connect LLMs to your file system, APIs, applications, databases, and the Internet.
π Prompt files and custom instructions β pre-load specific context and steer or template LLM responses.
πΊοΈ Plan mode β break down complex tasks into smaller steps, discuss implementation with the LLM, and execute an actionable plan.
π§© Skills β knowledge that AI plugs into context when it encounters a specific problem.
π€ Agents β AI-driven programs that autonomously execute tasks by orchestrating tools, skills, and APIs sequentially.
Check out:
worldbank.github.io/ai4coding Β© 2026 World Bank.